Comparing compact codebooks for visual categorization

نویسندگان

  • Jan C. van Gemert
  • Cees Snoek
  • Cor J. Veenman
  • Arnold W. M. Smeulders
  • Jan-Mark Geusebroek
چکیده

In the face of current large-scale video libraries, the practical applicability of content-based indexing algorithms is constrained by their efficiency. This paper strives for efficient large-scale video indexing by comparing various visual-based concept categorization techniques. In visual categorization, the popular codebook model has shown excellent categorization performance. The codebook model represents continuous visual features by discrete prototypes predefined in a vocabulary. The vocabulary size has a major impact on categorization efficiency, where a more compact vocabulary is more efficient. However, smaller vocabularies typically score lower on classification performance than larger vocabularies. This paper compares four approaches to achieve a compact codebook vocabulary while retaining categorization performance. For these four methods, we investigate the trade-off between codebook compactness and categorization performance. We evaluate the methods on more than 200 hours of challenging video data with as many as 101 semantic concepts. The results allow us to create a taxonomy of the four methods based on their efficiency and categorization performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Création de Vocabulaires Visuels Efficaces pour la Catégorisation d’Images. Creating Efficient Visual Codebooks for Image Categorization

We propose in this article an automatic method for building visual codebooks. Codebooks are obtained by quantizing local image descriptors and are used to automatically build discriminative representations of objects occuring in images. We describe an image categorization application based on the proposed approaches, providing results far above related state of the art existing methods.

متن کامل

Unsupervised and Supervised Visual Codes with Restricted Boltzmann Machines

Recently, the coding of local features (e.g. SIFT) for image categorization tasks has been extensively studied. Incorporated within the Bag of Words (BoW) framework, these techniques optimize the projection of local features into the visual codebook, leading to state-of-theart performances in many benchmark datasets. In this work, we propose a novel visual codebook learning approach using the r...

متن کامل

Speeded-up and Compact Visual Codebook for Object Recognition

The well known framework in the object recognition literature uses local information extracted at several patches in images which are then clustered by a suitable clustering technique. A visual codebook maps the patch-based descriptors into a fixed-length vector in histogram space to which standard classifiers can be directly applied. Thus, the construction of a codebook is an important step wh...

متن کامل

Kernel Codebooks for Scene Categorization

This paper introduces a method for scene categorization by modeling ambiguity in the popular codebook approach. The codebook approach describes an image as a bag of discrete visual codewords, where the frequency distributions of these words are used for image categorization. There are two drawbacks to the traditional codebook model: codeword uncertainty and codeword plausibility. Both of these ...

متن کامل

Analysis of Visual Impacts in Compact City’s Form

Desired physical form of cities has been noticeable since the beginning of urbanization, from old patterns of early civilizations to the latest urbanism’s theories, which offered to build better cities. The opinions in recent decades have expressed that compact physical form of cities is a better form than sprawl form to achieve urban sustainability. The form of the city is the embodiment of it...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computer Vision and Image Understanding

دوره 114  شماره 

صفحات  -

تاریخ انتشار 2010